Adaptive Speech Understanding for Intuitive Model-based Spoken Dialogues

نویسندگان

Tobias Heinroth

Maximilian Grotz

Florian Nothdurft

Wolfgang Minker

چکیده

In this paper we present three approaches towards adaptive speech understanding. The target system is a model-based Adaptive Spoken Dialogue Manager, the OwlSpeak ASDM. We enhanced this system in order to properly react on non-understandings in real-life situations where intuitive communication is required. OwlSpeak provides a model-based spoken interface to an Intelligent Environment depending on and adapting to the current context. It utilises a set of ontologies used as dialogue models that can be combined dynamically during runtime. Besides the benefits the system showed in practice, real-life evaluations also conveyed some limitations of the model-based approach. Since it is unfeasible to model all variations of the communication between the user and the system beforehand, various situations where the system did not correctly understand the user input have been observed. Thus we present three enhancements towards a more sophisticated use of the ontology-based dialogue models and show how grammars may dynamically be adapted in order to understand intuitive user utterances. The evaluation of our approaches revealed the incorporation of a lexical-semantic knowledgebase into the recognition process to be the most promising approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting and Adapting to Poor Speech Recognition in a Spoken Dialogue System

Spoken dialogue system performance can vary widely for different users, as well for the same user during different dialogues. This paper presents the design and evaluation of an adaptive version of TOOT, a spoken dialogue system for retrieving online train schedules. Adaptive TOOT predicts whether a user is having speech recognition problems as a particular dialogue progresses, and automaticall...

متن کامل

Jaspis^2 - an architecture for supporting distributed spoken dialogues

In this paper, we introduce an architecture for a new generation of speech applications. The presented architecture is based on our previous work with multilingual speech applications and extends it by introducing support for synchronized distributed dialogues, which is needed in emerging application areas, such as mobile and ubiquitous computing. The architecture supports coordinated distribut...

متن کامل

Subjective experiments on influence of response timing in spoken dialogues

To verify the validity of analysis results relating to dialogue rhythm from earlier studies, we produced spoken dialogues based on analysis results relating to response timing and the other spoken dialogues, and performed subjective experiments to investigate parameters such as the naturalness of the dialogue, the incongruity of the synthesized speech, and the ease of comprehension of the utter...

متن کامل

System Architectures for Speech-based and Multimodal Pervasive Computing Applications

Speech-based and multimodal interaction can be very efficient and natural way for human-computer communication in pervasive computing settings. The key features in these settings are the distributed and adaptive nature of interaction. In order to implement applications efficiently the system architecture must support these features. In this paper we discuss the requirements for speech-based per...

متن کامل

Mining Spoken Dialogue Corpora for System Evaluation and Modelin

We are interested in the problem of modeling and evaluating spoken language systems in the context of human-machine dialogs. Spoken dialog corpora allow for a multidimensional analysis of speech recognition and language understanding models of dialog systems. Therefore language models can be directly trained based either on the dialog history or its equivalence class (or cluster). In this paper...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2012

Adaptive Speech Understanding for Intuitive Model-based Spoken Dialogues

نویسندگان

چکیده

منابع مشابه

Predicting and Adapting to Poor Speech Recognition in a Spoken Dialogue System

Jaspis^2 - an architecture for supporting distributed spoken dialogues

Subjective experiments on influence of response timing in spoken dialogues

System Architectures for Speech-based and Multimodal Pervasive Computing Applications

Mining Spoken Dialogue Corpora for System Evaluation and Modelin

عنوان ژورنال:

اشتراک گذاری